Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 663 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 212.7 KiB |
| Average record size in memory | 328.5 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 6 |
ClusterID is highly overall correlated with Customer_Category and 4 other fields | High correlation |
Customer_Category is highly overall correlated with ClusterID and 6 other fields | High correlation |
Frequency is highly overall correlated with RFM_Sum | High correlation |
Frequency_Score is highly overall correlated with Customer_Category and 1 other fields | High correlation |
ID is highly overall correlated with K-Means Segments and 4 other fields | High correlation |
K-Means Segments is highly overall correlated with ClusterID and 5 other fields | High correlation |
Monetary is highly overall correlated with RFM_Sum | High correlation |
Monetary_Score is highly overall correlated with Customer_Category and 2 other fields | High correlation |
RFM Score is highly overall correlated with ClusterID and 5 other fields | High correlation |
RFM_Sum is highly overall correlated with Frequency and 4 other fields | High correlation |
Recency is highly overall correlated with ClusterID and 4 other fields | High correlation |
Recency_Score is highly overall correlated with ClusterID and 5 other fields | High correlation |
Recency_Score is uniformly distributed | Uniform |
ID has unique values | Unique |
Reproduction
| Analysis started | 2024-03-18 04:07:21.981110 |
|---|---|
| Analysis finished | 2024-03-18 04:09:28.951682 |
| Duration | 2 minutes and 6.97 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
ID
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 663 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1230800.8 |
| Minimum | 1230001 |
|---|---|
| Maximum | 1231797 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 1230001 |
|---|---|
| 5-th percentile | 1230049.1 |
| Q1 | 1230307.5 |
| median | 1230745 |
| Q3 | 1231278 |
| 95-th percentile | 1231690.9 |
| Maximum | 1231797 |
| Range | 1796 |
| Interquartile range (IQR) | 970.5 |
Descriptive statistics
| Standard deviation | 545.5876 |
|---|---|
| Coefficient of variation (CV) | 0.00044327855 |
| Kurtosis | -1.2708628 |
| Mean | 1230800.8 |
| Median Absolute Deviation (MAD) | 485 |
| Skewness | 0.21058599 |
| Sum | 8.1602094 × 108 |
| Variance | 297665.82 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1230001 | 1 | 0.2% |
| 1231119 | 1 | 0.2% |
| 1231123 | 1 | 0.2% |
| 1231126 | 1 | 0.2% |
| 1231129 | 1 | 0.2% |
| 1231134 | 1 | 0.2% |
| 1231135 | 1 | 0.2% |
| 1231137 | 1 | 0.2% |
| 1231139 | 1 | 0.2% |
| 1231146 | 1 | 0.2% |
| Other values (653) | 653 |
| Value | Count | Frequency (%) |
| 1230001 | 1 | |
| 1230002 | 1 | |
| 1230003 | 1 | |
| 1230004 | 1 | |
| 1230005 | 1 | |
| 1230006 | 1 | |
| 1230007 | 1 | |
| 1230008 | 1 | |
| 1230009 | 1 | |
| 1230010 | 1 |
| Value | Count | Frequency (%) |
| 1231797 | 1 | |
| 1231793 | 1 | |
| 1231789 | 1 | |
| 1231788 | 1 | |
| 1231787 | 1 | |
| 1231782 | 1 | |
| 1231781 | 1 | |
| 1231771 | 1 | |
| 1231763 | 1 | |
| 1231760 | 1 |
Monetary
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 445 |
|---|---|
| Distinct (%) | 67.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51316415 |
| Minimum | 60000 |
|---|---|
| Maximum | 4.31943 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 60000 |
|---|---|
| 5-th percentile | 60000 |
| Q1 | 300000 |
| median | 2090000 |
| Q3 | 38046000 |
| 95-th percentile | 4.303797 × 108 |
| Maximum | 4.31943 × 108 |
| Range | 4.31883 × 108 |
| Interquartile range (IQR) | 37746000 |
Descriptive statistics
| Standard deviation | 1.1112209 × 108 |
|---|---|
| Coefficient of variation (CV) | 2.1654297 |
| Kurtosis | 5.8548884 |
| Mean | 51316415 |
| Median Absolute Deviation (MAD) | 2025000 |
| Skewness | 2.6380496 |
| Sum | 3.4022783 × 1010 |
| Variance | 1.2348119 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 39 | 5.9% |
| 431943000 | 34 | 5.1% |
| 65000 | 17 | 2.6% |
| 85000 | 13 | 2.0% |
| 75000 | 9 | 1.4% |
| 140000 | 7 | 1.1% |
| 220000 | 6 | 0.9% |
| 95000 | 6 | 0.9% |
| 240000 | 6 | 0.9% |
| 320000 | 5 | 0.8% |
| Other values (435) | 521 |
| Value | Count | Frequency (%) |
| 60000 | 39 | |
| 65000 | 17 | |
| 70000 | 2 | 0.3% |
| 75000 | 9 | 1.4% |
| 80000 | 1 | 0.2% |
| 80320 | 1 | 0.2% |
| 85000 | 13 | 2.0% |
| 90000 | 4 | 0.6% |
| 95000 | 6 | 0.9% |
| 100000 | 3 | 0.5% |
| Value | Count | Frequency (%) |
| 431943000 | 34 | |
| 416310000 | 1 | 0.2% |
| 411740000 | 1 | 0.2% |
| 407295000 | 1 | 0.2% |
| 405600000 | 1 | 0.2% |
| 393040000 | 1 | 0.2% |
| 391620000 | 1 | 0.2% |
| 384686000 | 1 | 0.2% |
| 378175000 | 1 | 0.2% |
| 353970000 | 1 | 0.2% |
Frequency
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 20 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7119155 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 9 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.9021212 |
|---|---|
| Coefficient of variation (CV) | 1.4388801 |
| Kurtosis | 58.104459 |
| Mean | 2.7119155 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.196475 |
| Sum | 1798 |
| Variance | 15.22655 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 2 | 153 | |
| 3 | 46 | 6.9% |
| 4 | 34 | 5.1% |
| 5 | 24 | 3.6% |
| 7 | 13 | 2.0% |
| 6 | 12 | 1.8% |
| 9 | 9 | 1.4% |
| 8 | 9 | 1.4% |
| 13 | 8 | 1.2% |
| Other values (10) | 18 | 2.7% |
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 2 | 153 | |
| 3 | 46 | 6.9% |
| 4 | 34 | 5.1% |
| 5 | 24 | 3.6% |
| 6 | 12 | 1.8% |
| 7 | 13 | 2.0% |
| 8 | 9 | 1.4% |
| 9 | 9 | 1.4% |
| 10 | 3 | 0.5% |
| Value | Count | Frequency (%) |
| 53 | 1 | 0.2% |
| 38 | 1 | 0.2% |
| 29 | 1 | 0.2% |
| 26 | 1 | 0.2% |
| 24 | 1 | 0.2% |
| 19 | 2 | 0.3% |
| 15 | 3 | 0.5% |
| 14 | 1 | 0.2% |
| 13 | 8 | |
| 11 | 4 |
Recency
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 298 |
|---|---|
| Distinct (%) | 44.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 276.819 |
| Minimum | 0 |
|---|---|
| Maximum | 706 |
| Zeros | 3 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 110 |
| median | 238 |
| Q3 | 457 |
| 95-th percentile | 629 |
| Maximum | 706 |
| Range | 706 |
| Interquartile range (IQR) | 347 |
Descriptive statistics
| Standard deviation | 198.50706 |
|---|---|
| Coefficient of variation (CV) | 0.71710057 |
| Kurtosis | -1.0457902 |
| Mean | 276.819 |
| Median Absolute Deviation (MAD) | 153 |
| Skewness | 0.46977125 |
| Sum | 183531 |
| Variance | 39405.055 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 111 | 16 | 2.4% |
| 124 | 15 | 2.3% |
| 116 | 15 | 2.3% |
| 72 | 11 | 1.7% |
| 101 | 9 | 1.4% |
| 142 | 9 | 1.4% |
| 109 | 9 | 1.4% |
| 76 | 9 | 1.4% |
| 4 | 7 | 1.1% |
| 35 | 6 | 0.9% |
| Other values (288) | 557 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 1 | 4 | |
| 4 | 7 | |
| 6 | 4 | |
| 8 | 1 | 0.2% |
| 9 | 2 | 0.3% |
| 30 | 3 | |
| 31 | 5 | |
| 32 | 3 | |
| 33 | 5 |
| Value | Count | Frequency (%) |
| 706 | 1 | 0.2% |
| 705 | 4 | |
| 702 | 1 | 0.2% |
| 701 | 1 | 0.2% |
| 698 | 3 | |
| 697 | 1 | 0.2% |
| 685 | 1 | 0.2% |
| 683 | 1 | 0.2% |
| 681 | 1 | 0.2% |
| 675 | 1 | 0.2% |
Recency_Score
Categorical
HIGH CORRELATION  UNIFORM 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 37.7 KiB |
| 3 | |
|---|---|
| 4 | |
| 1 | |
| 2 | |
| 5 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 663 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 133 | |
| 4 | 133 | |
| 1 | 133 | |
| 2 | 132 | |
| 5 | 132 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 133 | |
| 4 | 133 | |
| 1 | 133 | |
| 2 | 132 | |
| 5 | 132 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 133 | |
| 4 | 133 | |
| 1 | 133 | |
| 2 | 132 | |
| 5 | 132 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 663 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 133 | |
| 4 | 133 | |
| 1 | 133 | |
| 2 | 132 | |
| 5 | 132 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 663 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 133 | |
| 4 | 133 | |
| 1 | 133 | |
| 2 | 132 | |
| 5 | 132 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 133 | |
| 4 | 133 | |
| 1 | 133 | |
| 2 | 132 | |
| 5 | 132 |
Frequency_Score
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 37.7 KiB |
| 1 | |
|---|---|
| 3 | |
| 5 | |
| 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 663 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 5 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 5 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 3 | 153 | |
| 5 | 127 | 19.2% |
| 4 | 46 | 6.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 3 | 153 | |
| 5 | 127 | 19.2% |
| 4 | 46 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 3 | 153 | |
| 5 | 127 | 19.2% |
| 4 | 46 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 663 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 3 | 153 | |
| 5 | 127 | 19.2% |
| 4 | 46 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 663 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 3 | 153 | |
| 5 | 127 | 19.2% |
| 4 | 46 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 3 | 153 | |
| 5 | 127 | 19.2% |
| 4 | 46 | 6.9% |
Monetary_Score
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 37.7 KiB |
| 1 | |
|---|---|
| 3 | |
| 5 | |
| 4 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 663 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 5 |
| 4th row | 4 |
| 5th row | 5 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 135 | |
| 3 | 133 | |
| 5 | 133 | |
| 4 | 132 | |
| 2 | 130 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 135 | |
| 3 | 133 | |
| 5 | 133 | |
| 4 | 132 | |
| 2 | 130 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 135 | |
| 3 | 133 | |
| 5 | 133 | |
| 4 | 132 | |
| 2 | 130 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 663 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 135 | |
| 3 | 133 | |
| 5 | 133 | |
| 4 | 132 | |
| 2 | 130 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 663 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 135 | |
| 3 | 133 | |
| 5 | 133 | |
| 4 | 132 | |
| 2 | 130 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 135 | |
| 3 | 133 | |
| 5 | 133 | |
| 4 | 132 | |
| 2 | 130 |
RFM Score
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 88 |
|---|---|
| Distinct (%) | 13.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 327.20513 |
| Minimum | 111 |
|---|---|
| Maximum | 553 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 111 |
|---|---|
| 5-th percentile | 114 |
| Q1 | 212 |
| median | 332 |
| Q3 | 435 |
| 95-th percentile | 515 |
| Maximum | 553 |
| Range | 442 |
| Interquartile range (IQR) | 223 |
Descriptive statistics
| Standard deviation | 136.88237 |
|---|---|
| Coefficient of variation (CV) | 0.41833809 |
| Kurtosis | -1.3123914 |
| Mean | 327.20513 |
| Median Absolute Deviation (MAD) | 118 |
| Skewness | -0.017794061 |
| Sum | 216937 |
| Variance | 18736.783 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 511 | 34 | 5.1% |
| 155 | 27 | 4.1% |
| 411 | 26 | 3.9% |
| 211 | 22 | 3.3% |
| 513 | 22 | 3.3% |
| 512 | 21 | 3.2% |
| 311 | 20 | 3.0% |
| 212 | 18 | 2.7% |
| 515 | 15 | 2.3% |
| 111 | 15 | 2.3% |
| Other values (78) | 443 |
| Value | Count | Frequency (%) |
| 111 | 15 | |
| 112 | 9 | |
| 113 | 5 | 0.8% |
| 114 | 7 | |
| 115 | 5 | 0.8% |
| 131 | 3 | 0.5% |
| 132 | 7 | |
| 133 | 7 | |
| 134 | 5 | 0.8% |
| 135 | 3 | 0.5% |
| Value | Count | Frequency (%) |
| 553 | 1 | 0.2% |
| 552 | 1 | 0.2% |
| 545 | 1 | 0.2% |
| 544 | 1 | 0.2% |
| 543 | 1 | 0.2% |
| 535 | 2 | 0.3% |
| 534 | 6 | |
| 533 | 4 | |
| 532 | 8 | |
| 531 | 4 |
RFM_Sum
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 12 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.4313725 |
| Minimum | 3 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.3 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 7 |
| median | 9 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 14 |
| Range | 11 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.4882283 |
|---|---|
| Coefficient of variation (CV) | 0.29511545 |
| Kurtosis | -0.70530335 |
| Mean | 8.4313725 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.1742135 |
| Sum | 5590 |
| Variance | 6.1912801 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 96 | |
| 10 | 96 | |
| 11 | 91 | |
| 7 | 88 | |
| 8 | 71 | |
| 6 | 66 | |
| 5 | 46 | |
| 12 | 37 | 5.6% |
| 4 | 31 | 4.7% |
| 13 | 22 | 3.3% |
| Other values (2) | 19 | 2.9% |
| Value | Count | Frequency (%) |
| 3 | 15 | 2.3% |
| 4 | 31 | 4.7% |
| 5 | 46 | |
| 6 | 66 | |
| 7 | 88 | |
| 8 | 71 | |
| 9 | 96 | |
| 10 | 96 | |
| 11 | 91 | |
| 12 | 37 | 5.6% |
| Value | Count | Frequency (%) |
| 14 | 4 | 0.6% |
| 13 | 22 | 3.3% |
| 12 | 37 | 5.6% |
| 11 | 91 | |
| 10 | 96 | |
| 9 | 96 | |
| 8 | 71 | |
| 7 | 88 | |
| 6 | 66 | |
| 5 | 46 |
Customer_Category
Categorical
HIGH CORRELATION 
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.5 KiB |
| New Customers | |
|---|---|
| Promissing | |
| At Risk | |
| Cannot Lose Them | |
| Potential Loyalist | |
| Other values (6) |
Length
| Max length | 21 |
|---|---|
| Median length | 14 |
| Mean length | 13.010558 |
| Min length | 5 |
Characters and Unicode
| Total characters | 8626 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | At Risk |
|---|---|
| 2nd row | Potential Loyalist |
| 3rd row | Loyal |
| 4th row | Need Attention |
| 5th row | Cannot Lose Them |
Common Values
| Value | Count | Frequency (%) |
| New Customers | 114 | |
| Promissing | 102 | |
| At Risk | 89 | |
| Cannot Lose Them | 80 | |
| Potential Loyalist | 75 | |
| Hibernating Customers | 62 | |
| Loyal | 35 | 5.3% |
| Need Attention | 35 | 5.3% |
| About To Sleep | 28 | 4.2% |
| Lost Customers | 28 | 4.2% |
Length
| Value | Count | Frequency (%) |
| customers | 204 | |
| new | 114 | 8.9% |
| promissing | 102 | 8.0% |
| at | 89 | 6.9% |
| risk | 89 | 6.9% |
| cannot | 80 | 6.2% |
| lose | 80 | 6.2% |
| them | 80 | 6.2% |
| loyalist | 75 | 5.9% |
| potential | 75 | 5.9% |
| Other values (9) | 294 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 899 | 10.4% |
| t | 821 | 9.5% |
| o | 785 | 9.1% |
| e | 776 | 9.0% |
| 619 | 7.2% | |
| i | 617 | 7.2% |
| n | 546 | 6.3% |
| m | 401 | 4.6% |
| r | 368 | 4.3% |
| a | 342 | 4.0% |
| Other values (19) | 2452 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6725 | |
| Uppercase Letter | 1282 | 14.9% |
| Space Separator | 619 | 7.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 899 | |
| t | 821 | |
| o | 785 | |
| e | 776 | |
| i | 617 | |
| n | 546 | |
| m | 401 | |
| r | 368 | 5.5% |
| a | 342 | 5.1% |
| u | 232 | 3.4% |
| Other values (9) | 938 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 299 | |
| L | 218 | |
| P | 177 | |
| A | 152 | |
| N | 149 | |
| T | 108 | 8.4% |
| R | 89 | 6.9% |
| H | 62 | 4.8% |
| S | 28 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 619 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8007 | |
| Common | 619 | 7.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 899 | |
| t | 821 | 10.3% |
| o | 785 | 9.8% |
| e | 776 | 9.7% |
| i | 617 | 7.7% |
| n | 546 | 6.8% |
| m | 401 | 5.0% |
| r | 368 | 4.6% |
| a | 342 | 4.3% |
| C | 299 | 3.7% |
| Other values (18) | 2153 |
Common
| Value | Count | Frequency (%) |
| 619 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8626 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 899 | 10.4% |
| t | 821 | 9.5% |
| o | 785 | 9.1% |
| e | 776 | 9.0% |
| 619 | 7.2% | |
| i | 617 | 7.2% |
| n | 546 | 6.3% |
| m | 401 | 4.6% |
| r | 368 | 4.3% |
| a | 342 | 4.0% |
| Other values (19) | 2452 |
ClusterID
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 37.7 KiB |
| 1 | |
|---|---|
| 2 | |
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 663 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 2 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 374 | |
| 2 | 232 | |
| 0 | 57 | 8.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 374 | |
| 2 | 232 | |
| 0 | 57 | 8.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 374 | |
| 2 | 232 | |
| 0 | 57 | 8.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 663 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 374 | |
| 2 | 232 | |
| 0 | 57 | 8.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 663 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 374 | |
| 2 | 232 | |
| 0 | 57 | 8.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 663 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 374 | |
| 2 | 232 | |
| 0 | 57 | 8.6% |
K-Means Segments
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 48.2 KiB |
| Regular Customers | |
|---|---|
| Inactive Customers | |
| Premium Shoppers |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.263952 |
| Min length | 16 |
Characters and Unicode
| Total characters | 11446 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Regular Customers |
|---|---|
| 2nd row | Regular Customers |
| 3rd row | Premium Shoppers |
| 4th row | Inactive Customers |
| 5th row | Premium Shoppers |
Common Values
| Value | Count | Frequency (%) |
| Regular Customers | 374 | |
| Inactive Customers | 232 | |
| Premium Shoppers | 57 | 8.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| customers | 606 | |
| regular | 374 | |
| inactive | 232 | 17.5% |
| premium | 57 | 4.3% |
| shoppers | 57 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1326 | |
| s | 1269 | |
| r | 1094 | |
| u | 1037 | 9.1% |
| t | 838 | 7.3% |
| m | 720 | 6.3% |
| o | 663 | 5.8% |
| 663 | 5.8% | |
| a | 606 | 5.3% |
| C | 606 | 5.3% |
| Other values (12) | 2624 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9457 | |
| Uppercase Letter | 1326 | 11.6% |
| Space Separator | 663 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1326 | |
| s | 1269 | |
| r | 1094 | |
| u | 1037 | |
| t | 838 | |
| m | 720 | |
| o | 663 | |
| a | 606 | |
| l | 374 | 4.0% |
| g | 374 | 4.0% |
| Other values (6) | 1156 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 606 | |
| R | 374 | |
| I | 232 | 17.5% |
| P | 57 | 4.3% |
| S | 57 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 663 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10783 | |
| Common | 663 | 5.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1326 | |
| s | 1269 | |
| r | 1094 | |
| u | 1037 | |
| t | 838 | 7.8% |
| m | 720 | 6.7% |
| o | 663 | 6.1% |
| a | 606 | 5.6% |
| C | 606 | 5.6% |
| R | 374 | 3.5% |
| Other values (11) | 2250 |
Common
| Value | Count | Frequency (%) |
| 663 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11446 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1326 | |
| s | 1269 | |
| r | 1094 | |
| u | 1037 | 9.1% |
| t | 838 | 7.3% |
| m | 720 | 6.3% |
| o | 663 | 5.8% |
| 663 | 5.8% | |
| a | 606 | 5.3% |
| C | 606 | 5.3% |
| Other values (12) | 2624 |
| ClusterID | Customer_Category | Frequency | Frequency_Score | ID | K-Means Segments | Monetary | Monetary_Score | RFM Score | RFM_Sum | Recency | Recency_Score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ClusterID | 1.000 | 0.549 | -0.329 | 0.238 | -0.258 | 1.000 | -0.375 | 0.439 | 0.676 | -0.030 | 0.735 | 0.650 |
| Customer_Category | 0.549 | 1.000 | -0.400 | 0.575 | -0.198 | 0.549 | -0.106 | 0.542 | 0.672 | 0.085 | 0.706 | 0.604 |
| Frequency | -0.329 | -0.400 | 1.000 | 0.350 | -0.442 | 0.224 | 0.409 | 0.098 | -0.159 | 0.686 | -0.357 | 0.162 |
| Frequency_Score | 0.238 | 0.575 | 0.350 | 1.000 | -0.435 | 0.238 | 0.402 | 0.240 | -0.150 | 0.689 | -0.349 | 0.219 |
| ID | -0.258 | -0.198 | -0.442 | -0.435 | 1.000 | 1.000 | -0.153 | 1.000 | -0.515 | -0.585 | -0.399 | 1.000 |
| K-Means Segments | 1.000 | 0.549 | 0.224 | 0.238 | 1.000 | 1.000 | 0.065 | 0.439 | -0.762 | -0.232 | -0.772 | 0.650 |
| Monetary | -0.375 | -0.106 | 0.409 | 0.402 | -0.153 | 0.065 | 1.000 | 0.470 | -0.013 | 0.735 | -0.170 | 0.019 |
| Monetary_Score | 0.439 | 0.542 | 0.098 | 0.240 | 1.000 | 0.439 | 0.470 | 1.000 | -0.018 | 0.731 | -0.172 | 0.046 |
| RFM Score | 0.676 | 0.672 | -0.159 | -0.150 | -0.515 | -0.762 | -0.013 | -0.018 | 1.000 | 0.415 | 0.953 | 0.935 |
| RFM_Sum | -0.030 | 0.085 | 0.686 | 0.689 | -0.585 | -0.232 | 0.735 | 0.731 | 0.415 | 1.000 | 0.195 | 0.261 |
| Recency | 0.735 | 0.706 | -0.357 | -0.349 | -0.399 | -0.772 | -0.170 | -0.172 | 0.953 | 0.195 | 1.000 | 0.867 |
| Recency_Score | 0.650 | 0.604 | 0.162 | 0.219 | 1.000 | 0.650 | 0.019 | 0.046 | 0.935 | 0.261 | 0.867 | 1.000 |
| ID | Monetary | Frequency | Recency | Recency_Score | Frequency_Score | Monetary_Score | RFM Score | RFM_Sum | Customer_Category | ClusterID | K-Means Segments | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1230001 | 905000.000 | 3 | 137 | 2 | 4 | 3 | 243 | 9 | At Risk | 1 | Regular Customers |
| 1 | 1230002 | 4833000.000 | 4 | 258 | 3 | 5 | 3 | 353 | 11 | Potential Loyalist | 1 | Regular Customers |
| 2 | 1230003 | 431943000.000 | 2 | 346 | 4 | 3 | 5 | 435 | 12 | Loyal | 0 | Premium Shoppers |
| 3 | 1230004 | 28800000.000 | 2 | 343 | 4 | 3 | 4 | 434 | 11 | Need Attention | 2 | Inactive Customers |
| 4 | 1230005 | 220728000.000 | 15 | 70 | 1 | 5 | 5 | 155 | 11 | Cannot Lose Them | 0 | Premium Shoppers |
| 5 | 1230006 | 174170000.000 | 9 | 109 | 2 | 5 | 5 | 255 | 12 | At Risk | 1 | Regular Customers |
| 6 | 1230007 | 189442000.000 | 53 | 9 | 1 | 5 | 5 | 155 | 11 | Cannot Lose Them | 0 | Premium Shoppers |
| 7 | 1230008 | 6783000.000 | 24 | 162 | 3 | 5 | 3 | 353 | 11 | Potential Loyalist | 1 | Regular Customers |
| 8 | 1230009 | 220000.000 | 2 | 284 | 3 | 3 | 2 | 332 | 8 | Hibernating Customers | 1 | Regular Customers |
| 9 | 1230010 | 2790000.000 | 5 | 162 | 3 | 5 | 3 | 353 | 11 | Potential Loyalist | 1 | Regular Customers |
| ID | Monetary | Frequency | Recency | Recency_Score | Frequency_Score | Monetary_Score | RFM Score | RFM_Sum | Customer_Category | ClusterID | K-Means Segments | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 653 | 1231760 | 75000.000 | 1 | 0 | 1 | 1 | 1 | 111 | 3 | Lost Customers | 1 | Regular Customers |
| 654 | 1231763 | 2520000.000 | 1 | 58 | 1 | 1 | 3 | 113 | 5 | Cannot Lose Them | 1 | Regular Customers |
| 655 | 1231771 | 1065000.000 | 2 | 48 | 1 | 3 | 3 | 133 | 7 | At Risk | 1 | Regular Customers |
| 656 | 1231781 | 810000.000 | 1 | 50 | 1 | 1 | 2 | 112 | 4 | Lost Customers | 1 | Regular Customers |
| 657 | 1231782 | 680000.000 | 1 | 50 | 1 | 1 | 2 | 112 | 4 | Lost Customers | 1 | Regular Customers |
| 658 | 1231787 | 65000.000 | 1 | 44 | 1 | 1 | 1 | 111 | 3 | Lost Customers | 1 | Regular Customers |
| 659 | 1231788 | 60000.000 | 1 | 44 | 1 | 1 | 1 | 111 | 3 | Lost Customers | 1 | Regular Customers |
| 660 | 1231789 | 500000.000 | 1 | 44 | 1 | 1 | 2 | 112 | 4 | Lost Customers | 1 | Regular Customers |
| 661 | 1231793 | 140000.000 | 1 | 41 | 1 | 1 | 1 | 111 | 3 | Lost Customers | 1 | Regular Customers |
| 662 | 1231797 | 1000000.000 | 1 | 41 | 1 | 1 | 3 | 113 | 5 | Cannot Lose Them | 1 | Regular Customers |